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Abstract. We show any subset A C N with positive upper Banach density 
contains the pattern {to, to + [not], . . . , to + fc[na]}, for some to G N and n — 
p — 1 for some prime p, where a G R\Q. Making use for the Furstenberg 
Correspondence Principle, we do this by proving an associated recurrence result 
in ergodic theory along the shifted primes. We also prove the convergence result 
for the associated averages along primes. We also indicate other applications of 
these methods. 

1. Statement of Results 

1.1. Introduction. A measure preserving system (X, X,fi,T) consists of a prob- 
ability space (X, X, fi) and a measurable, measure preserving transformation T 
acting on it. Throughout this paper, we assume T is invertible. The limit behav- 
ior (existence and positivity) in L 2 (fi) of a measure preserving system (X, X, /i, T) 
of multiple averages of the form 

i N 

*! im Tr E fi(T ai{n) x) ■ . . . ■ f k (T a ^x) 

iV— >oo /V z — ' 

n=l 

has been widely studied in recent years for different choices of functions a;(n), 
where a^n) : N — > N are integer sequences (see, for example, [T], [TU], [H], [H], [IB], 

HQ- M)- 

In this paper, we study similar results for averages over shifted primes, meaning, 
the primes or the primes plus or minus 1. Let P denote the set of all primes and 
ir(N) denote the number of primes up to N. For all x G R, {x} denotes the 
fractional part of x and [x] denotes the largest integer which is no larger than x. 
We prove the following theorem: 

Theorem 1.1. Let k G N,a G R\Q, (X, X, fi, T) be a measure preserving system. 
Let fi, . . . , f k G L°°(/i). Then the average 

1 T^h-...-T k ^f k 

^ ' P E[l,N]nf 

converges in L 2 (fi) as N — >• oo. The conclusion is also true if i[na] is replaced by 
[ina] for i — 1, . . . , k. 

l 
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Frantzikinakis, Host and Kra jS] first proved for the linear polynomial case of 
Theorem ll.il (i.e. the case all i[na] are replaced with in), with the proof of the case 
k > 3 conditional upon the results of [T3] and [IB] that were subsequently proven. 
Then Wooley and Ziegler [23] proved the convergence result for the polynomial case 
(i.e. the case all i[na] are replaced with some polynomial with integer coefficients) 
for a single transformation. After that, these results were generalized to the multi- 
dimensional polynomial case for commutative transformations by Frantzikinakis, 
Host and Kra [9]. It is natural to ask whether one can prove a version of these 
theorems for generalized polynomials. Theorem 11.11 extends these results to linear 
generalized polynomials. 

By what is now referred to as the Furstenberg Correspondence Principle, first 
introduced in [TO], the positivity of 

J f .T ai ^f ■ ...-T ak{n) fdfx 

for some / > not identically is equivalent to the positivity of the upper Banach 
density d* of the set 

d* (En(E- ai(n)) n • • • n (E - a fc (n))) 

for some E C N, where d*(E) = limsupiJi^^ ^jjp. So it is natural to ask when 
the limit in Theorem 11.11 is positive. We prove the following theorem: 

Theorem 1.2. Let (X, X, /i, T) be a measure preserving system, A be a measurable 
set with fi(A) > and a G R\Q be an irrational number. Then the set of integers 
n such that 

n{A n T~ lna] A n • • ■ n T- [kna] A) > o 

has nonempty intersection with P+ 1, and with P — 1. The conclusion is also true 
if [ina] is replaced by i[na] for i — 1, . . . , k. 

We immediately deduce the following: 

Corollary 1.3. For any k G M, a G M\Q, every subset AcN with positive upper 
Banach density contains the following pattern: 

{m,m + [na] , . . . , m + [kna] } , 

for some m G N,n G P + l, and for some mGN,fiGP-l. The conclusion is 
also true if [ina] is replaced by i[na] for i = 1, . . . , k. 

Remark 1.4. It is worth noting that Theorem fails if P + 1 is replaced with 
P. For example, if (X, X , /i, T) is the 1-dimensional torus with Tx = x + 

mod 1) and A = (-1/8, 1/8), then /i(A n T~ [n ^A) > only if n is even. 
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The first result in this direction was obtained by Sarkozy [23], who proved the 
difference set of E — E for a set E with positive upper Banach density contains 
a shifted prime p — 1 for some p G P (and a similar result holds for p + 1). The 
linear polynomial case of Theorem 11.21 was proved in [8], with the proof of the case 
k > 3 conditional upon the results of [H] and [16] that were subsequently proven. 
Then Wooley and Ziegler [23] proved the recurrence result for the polynomial case 
for a single transformation. Then Bergelson, Leibman and Ziegler [5] proved the 
linear polynomial case of Theorem 11.21 for the multi-dimensional case. The multi- 
dimensional polynomial version of Theorem 11.21 for commutative transformations 
was proved by Frantzikinakis, Host and Kra [9]. 

The method used in proving Theorems 1 1 . 1 1 and 1 1 . 2 1 also applies to other results 
on shifted primes for cubes and certain weighted averages. We indicate how such 
results can be obtained in Section 5. 

I. 2. Strategy and Organization. The technique used in [9] was to compare 
the average under consideration along the shifted primes to the analogous average 
along all the integers. In order to obtain results for averages along [na] (Theorems 

II. 11 and ll.2p . we need to modify the technique. Roughly speaking, the difficulty 
is that the difference a(n + h) — a(n) is independent of n when a(n) = cn, c G N, 
but this is not the case when a{n) = [na]. To overcome this obstacle, we need 
to consider only those n such that {na} lies in some short interval to guarantee 
a(n + h) — a(n) is constant. 

The background material is presented in Section 2. Section 3 is used to obtain 
some estimate for Gowers norms that is used in later sections. We present the 
proofs of Theorems 11.11 and 11.21 in Section 4. Other applications of the estimate 
for Gowers norms are presented in Section 5. 

Ackonwledgment . The author would like to thank N. Frantzikinakis for helpful 
discussions related to the estimate for Gowers norms and the material in Section 
4, and B. Kra for all the instructive advices in preparation of this article. 

2. Background 

2.1. Notation. Throughout this paper, we denote the set of positive integers by 
N, the set of real numbers by R, and the set of rational numbers by Q. Write 
Z N = Z/NZ. When needed, the set Z N is identified with [N] : = {1, . . . , JV}. 

We use ojv(1) to denote a quantity that converges to when N — > oo and all 
other parameters are fixed. The expression a(n) <C b(n) stands for a(n) < Cb(n) 
for some constant C. If C depends on some parameter d, we write a(n) b(n). 

If A is a subset of a space X, we write l^(^) to be the index function of A, taking 
value 1 for x & A and eslewhere. We denote the space of continuous functions 
on X by C(X). For any complex- valued function g(n), denote Cg(n) = gin). US 
is a finite set and a: S — > C is a function on S, we write E ne 5a(n) = |-^yS ne 5a(n). 



4 



WENBO SUN 



2.2. Gowers norms. For any function a: Z^v — > C, we inductively define: 



\m(z N ) 



E 



and 



Mli/d+iC^jv) — ( ^hez N \\cthCi 



2 d \ l/2 d +! 



where a^(n) = a(n + /i). Gowers [12] showed that this defines a norm on functions 
on Z^v for d > 2. These norms were later used by Green, Tao and others in 
studying the primes (see, for example, [13], [U] and [15]). Analogous semi- norms 
were defined in the ergodic setting by Host and Kra [18]. 



2.3. Nilsequences and Nilmanifolds. Let G be a Lie group. Denote G\ = G, 
Gi + \ = [G,Gi] = {ghg^h^ 1 : g E G,h E Gi},i > 1. We say G is d-step nilpotent 
if Gd+i = {1} and the system (G/T, X, fM,T) is called a d-step nilmanifold, where 
r C G is a discrete cocompact subgroup, /i is the measure induced by the Haar 
measure of G, X is the Borel a-algebra and T is the transformation given by 
T{x) = gx for some fixed g G G. If F is a continuous function on a d-step system 
{G/T, X, /i, T), for any x G G/r, we say {F(T n x)} nf z^ is a 6aszc d-step nilsequence. 
A sequence is called a d-step nilsequence if it is the uniform limit of basic d-step 
nilsequences. 

Nilsequences were introduced by Bergelson, Host and Kra [3] and they play an 
important role in the study of multiple averages and in the estimate of Gowers 
norms (see, for example, [3j, p3], ^5\, [IS] [32], [H], [H], [21] and [2S]). As we 
need to be quantitative regarding these nilmanifolds, we endow each manifold with 
an arbitrary smooth Riemannian metric. We then define the Lipschitz constant of 
a basic nilsequence {F(T n x)} ng N to be the Lipschitz constant of F. Notice that 
the Lipschitz constant of a basic nilsequence {F(T n x)} n£ fq is independent of the 
transformation T. 

2.4. van der Corput Lemma. The use of the van der Corput Lemma in studying 
multiple averages in ergodic theory was introduced by Bergelson [TJ. In this paper, 
we use a variation of the estimation of van der Corput: 

Lemma 2.1. Let {v(n)}^ =l be a sequence of elements in a Hilbert space with norm 
|| • || and inner product (•,•). Then 

^ N , N ..„ ,7V , N-h 



71=1 n=l h=l n=l 



The proof of a special case of this lemma can be found in [20] and the proof of 
the general case is essentially the same. 
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3. Estimation of Modified von Mangoldt Function 

Throughout this paper, we let T = R/Z denote the 1 dimensional torus with 
Haar measure A, associated cx-algebra X and transformation R a (x) = x + a(mod 
1), and we call the system (T, X, A, R a ) the 1-dimensional torus. For convenience, 
we sometimes identify T with the interval [0, 1]. 

Let A: N — > R denote the von Mangoldt function, taking the value logp on the 
prime p and its powers and elsewhere, and let A'(n) = lp(n)A(n). We use the 
von Mangoldt function to replace the indicator function lp(n) since it has better 
analytic properties. We have the following lemma: 

Lemma 3.1. If a(ni, . . . , njt) : N fc — > C is bounded, then 



lim 



7T 



(NY 



a(pi,...,p fc ) - 



Kn 



,...,nh<N i=l 



0. 



The proof of the case fc = 1 can be found in for example [8], and the general 
case can be derived easily from the special case. 

Throughout this paper we denote W = H P £p, p<w p for w > 2, i.e. W is al- 
ways assumed to be an integer depending on w. For r G N, we always denote 
A«;,r( n ) = A'(VFn + r), where is the Euler function, and A' w r (n) is referred 
to as a modified von Mangoldt function. The key to the study of convergence and 
recurrence results along primes is the estimate of the Gowers norms of this modi- 
fied von Mangoldt function. The purpose for this section is to establish a variation 
of Theorem 7.2 of [IB] , which allows us to estimate not only the Gowers norms of 
the modified von Mangoldt function, but also that of the product of the modified 
von Mangoldt function and some "well-behaved" sequences. The method we use 
basically follows the one used in [13]. We prove: 

Proposition 3.2. (1) For any bounded basic k-step nilsequence {F(g n x)} nf z^ with 
bounded Lipschitz norm, both 



and 



sup 

l<r<w,(r,w)- 



sup 

l<r<w,(r,w)=l 



IhN] 



[n)F(g 



U k {Z kN ) 



(A; )r -l)l M] (n)F(M 



u k {i kN ) 



converge to as N — >■ oo and then w — > oo. 

(2) Let (T,X,X,R a ) be the 1-dimensional torus with a G 
an interval. Then for any xeT, both 



Let A e X be 



sup 

l<r<w,(r,w)=l 



(A' -l)l [MV] (n)l A (j£ 



X 



u k {i kN ) 



G 
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and 



sup 

l<r<w,(r,w)=l 

converge to as N 



(A' w>r -l)l [1>N] (n)l A (R n a x) 
— > oo and then w — >■ oo. 



Remark 3.3. Our definition of nilsequences is different from the one used in [13 
The basic nilsequence defined in this paper is called a nilsequence in [13] . 



Proof. For any 5 > 0, we can find two non-negative smooth functions Ff(x) < 
Ia(^) < F^(x) such that J T (F^— Ff)d\ < Here Cs,k is some constant to be 

chosen latter depending only on k and 5. We temporally write Gw(n) to represent 
one of the following functions: F(g Wn x), F(g n x), l^(i?^f n x), lA(R^ n x). Suppose 
the statement is not true for one of the functions Gw{n) listed above. Then 
there exists S > and sequences of integers {wat}7v g n, {nvjiveN, {Wjv}jveN of non- 
negative integers and IcNan infinite subset, such that Wn = Yl P eF P <w N Pi Wjsr = 
o(\ogN),r N is coprime with Wn, and Wn — > oo as N — > oo, and 



(a: 



w N ,r N 



l)l[ lt N]{n)G WN (n) 



u k (i kN ) 



> 5 



for all N & L Moreover, I can be chosen such that 



(1) 

and 

(2) 



1 ft AT 

n=l J 1 



< S'/8Cs tl 



1 kN r 

m Y,Fc(R>) - j r F!d\\ < 5'/8C s , k 



for c = I and w, iV e /(we can do so because the order of the choice is that first 
a sequence of w is picked and then we attach to each w with some N sufficiently 
large). Here 8' is some constant to be chosen latter depending only on 5 and k. By 
Propositions 10.1 and 6.4 of [32], (Proposition 10.1 of [13] was based on the inverse 
conjecture for the Gowers norms which was later proved in [TB]) there exists 5' > 
and a finite collection of (k — l)-step nilmanifolds U such that for anyiV G /, there 
exists a basic {k — l)-step nilsequence {F' N (h n x)} ne z from one of the manifolds in 
U with bound 1 and Lipschitz constant 0^(1) such that 



(3) 



E n < kN (A' Wtr - l)l [hN] {n)G WN {n)F' N {h n x) 



>S'. 



We now choose the 5' appear in (JTJ and (J2J) to be the same 5' appear in ([3]) (this 
is possible because 5' is independent of N), and choose Cg t k = Os,k(l) to be the 
uniform Lipschitz bound for all F' N , N 6 I. By passing to an infinite subset of 
/, we can assume all {F' N {h n x)} nl ^ comes from the same nilmanifold Y . We still 
denote this subset by I for convenience. 
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Case that Gw(n) = F(g Wn x) or F(g n x). In this case, for each N G I, {S^^n) : : 
Gw N (n)F' N (h n x)} n( z^ is a basic nilsequence from X x Y with a uniform Lipschitz 
bound. Then (j3J) contradicts Proposition 10.2 of [13]. 

Case that G w (n) = 1a(-R» or l A (it^ n x). We assume G w (n) = l^-R^x) 
since the proof of the other case is identical. We may assume without loss of 
generality that all functions F^(x), N G J are nonnegative (otherwise we can split 
F' N (x) as the difference of two nonnegative functions and use the argument for 
each one). Then 

®n<kN(A' w>r ~ l)l {1 , N] (n)l A (R^ n x)F' N (h n x) 

< ®n<w(K, r ( n ) F u(Ra Nn x) ~ F?(R^ x ))l [hN] (n)F' N (h n x) 

< E n < m (A' Wjr - l)l [ltN] (n)Fl'(R^ n x)F' N (h n x) 

+ Cs,kE n < kN (F^'{R^ n x) - Ff(R™» n x)). 

Similar to the discussion of the first case, by Proposition 10.2 of [T3J, the first term 
on the right hand side goes to as iV — > oo. For the second term, by (TjQ) we have 

C d , k E n < kN (F*(R^ x ) - Ff(R*» n x)) 
<C 5 , k f{F* -F?)d\ + 5'/A 
< S'/2. 

Thus the left hand side of (pEJ) is less than 5'. Similarly, the left hand side of (jlj) is 
greater than —5'. This contradicts (J3j). This finishes the proof. □ 

4. Convergence and Recurrence along Shifted Primes on [na] 

4.1. Comparison with Averages along Integers. The following proposition 
shows that the weighted averages along some properly chosen subsets of [1, N] for 
[na] is controlled by the Gowers norms of a related function: 

Proposition 4.1. Let a G R\Q. Let a^n) = [ina] or i[na). Let (X, X, fJ,,T) 
be a measure preserving system and fo, fi, ■ ■ ■ , fk £ L°°(fj,). Let b(n): N — > C 
be a sequence of complex numbers satisfying b(n)/n c — > for all c > 0. Let 
: T — > R be a bounded function whose support is contained in an interval that 
does not contain a point , where < i < (k + 1)!. Then 

ii 1 - 

Kn)^{na})f T a ^h • . . . ■ T a ^ f k 

(5) " 7V n=i 



b(n)£({na}) ■ l [ltN] 



+ o N (l) 

u k {i kN ) 



Furthermore, the implicit constant is independent o/{6(n)} nS N and the oat(1) term 
depends only on the integer d and {&(n)} ne p*j. 
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Remark 4.2. The function i[na] is generally easier to treat than [ina], and we 
can replace (k + 1)! with 2 in the first case. But we write them together since the 
proofs are similar. 

Proof. Without loss of generality, we assume that all functions fa are bounded by 
1. Before we prove the general case, we give an example for the case k = 1 which 
is easier and illustrates the main idea. 

By Lemma 12.14 the Cauchy-Schwartz inequality and the invariance of T, 



1 N 



n=l 
N N-h 



4 



« It Yljj E Knm{na})b(n + h)£({(n + h)a})ftT 



N^WN 

h=l n=l 



[(n+h)a] — [na] y 



IP (ft) 



+ N (1) 



where on(1) depends on {b(n)} n£ fq. Suppose is supported in an interval 
/ C (0, 1), |/| < 1/2. Notice that [(n + h)a] - [na] = ha - {(n + h)a} + {na}. 
Thus if {na}, {{n+h)a} G I, then {ha} < 1/2 implies [(n+/i)a] — [na] = [ha], and 
{ha} > 1/2 implies [(n + h)a] - [na] = [ha] + 1. Let A N = {h e [1, N] : < 
1/2}. Then by splitting the summand of h along [1, N] into A N and A^, we get 



N „ JV-h 



h tlli E & (n)e(W)6(n + fe)£({(n + /iM^T^H-M^ 



h=l n=l 
AT-h 



^ 4 E I Tj E 6(")e({na})6(n + /*)£({(" + /»)«}) 



AT ^ \N 

heAff n=l 



N-h 



+ Jf E | ^ E W)6(" + + &M) 



l 2 M 



fc(n)£({na}) • l^jJ 



U 2 (Z 2N ) 



The last inequality follows because the left hand side is bounded by a multiple of 
right hand. This finishes the proof of the case k = 1. 

We now give the proof of the general case. Write ao(n) = 0. Denote g(n) = 
b(n)£({na}) and g(n; hi,..., h t ) = ILe{o,i}* C lel g(n + Y!j=i e A) (Recall Cf(n) = 

7R). 
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Claim, for any < t < k, there exists a function F(-; hi,..., h t ) G L°°(ji) 
bounded by 1 (this function depends on hi, . . . , h t but not on n) such that 



1 - 

\-J2g(n)f T a ^f 1 -...-T a ^f k 



n=l 



2 fc+i 



j j JV-hi ftt 

«^ E at E 9{n;hi,...,ht) 



l<h lr ..,ht<N 



n=l 



F{-;hi,...,h t ) Yl J J T a *(«+Ei=i^ h i)-°«(")cl e l/ < 

i=t+l ee{o,i}< 



jfc+l-t 



L2( M ) 



+ Ojv(1). 



If the claim is true, let t = k, we get the left hand side of ([5} is bounded by 

iV-fti 

Art E | at E 



l<hx,...,ht<N 



n=l 



which is exactly the right hand side of (J5J) since this is the expansion of the Gowers 
norm || ■ \\u k (z kN ) °f 5 l ( ?7 -)l[i,A r ]( n )- m order to prove the claim, it suffices to show 
for any 1 < t < k, any F'{-; hi, . . . , ht-i) G L°°(fi) bounded by 1, there exists 
F(-; hi, ... , h t ) G L°°{n) bounded by 1, such that 



(6) 



JV-fti ht-i 

E g(n;h h ...,h t -i) 

71=1 

F'(- ; ^i,---,^-i)n n r a ' {n+E ^i^^ ) - at - i(n) c |e| / i 

1^,1 ^-^1 ft i 

^-atEII-aT E 9{^h u ...,ht) 



2^+2— ( 

L2( M ) 



n=l 



F{--hi,...,h t ) Yl ] | T°^ n+ ^-=i e ^^- a *( n )C |e| /j 

i=t+l ££{0,1}* 



2 fc+i 



L2( M ) 



+ 07V(1) 



for any 1 < hi , . . . , ht-i < N. By Lemma 12.1 1 the Cauchy-Schwartz inequality 
and the invariance of T, the right hand side of (jSJ) is bounded by ojy(l) plus 



AT iV-/u 



vrEll^ E 



iV ^ IJV 

iit=l 



, . . . , n t) 



n=l 



rpa i (n+Y 1 t :j=1 ejhj)-at{n)+at-i(n+etht)-at-i(n)^\e\ j; 

i=t e£{0,l} f 



fk + l-t 
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where ojv(1) depends on {b(n)} n ^ (each function is translated by j ,a t-i(«)- a *( n )). 
So in order to prove the claim, it suffices to show if g(n; hi, . . . , h t ) 7^ 0, 

p _ 2 n at(n+E* =1 ej/ij)-a t (n)+a t _i(n+eih. t )-a t _i(n)^|e|y 

££{0,1}* 

is a function bounded by 1 and is independent of the choice of n. It suffices to 
show for any e G {0, 1}*, if g(n; hi, ... , h t ) ^ 0, then value of 

t 

(7) a v (n + 2_. t jhj) - a v (n) 

3=1 

is independent of n, where 1 < v < k. Suppose is supported in an interval 
JcT, (fc+i)! ^ ^ for all < i < (k + 1)!. We discuss case by case: 
Case that a^n) = [wia]. Notice that 

t t r r— 1 

+ ^ ^AH - [vna] = J2([v(n + ^ e^a] - [v(n + ^ e^Hj . 

j=l r =l j=l i=l 

If g(n; hi, . . . ,h t ) ^ 0, then for each 1 < r < t, we must have {(n + YTj=i £jhj)a}, 

{( n + Y!j=\ £jhj)a} G /, so {v((n + YJ j= i e A)) a }> M n + Ej=l e A) a } e w/: = 
{fx: s G /}. By the property of /, the length of vl is less than and vl does 
not contain any point ^tj(0 < s < f + 1). Thus it is easy to verify that 

r r— 1 

[f (n + y^jh^a] - [v(n + y^jh^oc] = e r ([vh r a\ + D v (h r )), 

3=1 3=1 

where D v (h r ) is the unique integer such that {h r a} G ( D ^?^\ D "^ +1 ). This 
means (j7j) is independent of n if #(n; hi, ... , h t ) ^ 0. 
Case that a^n) = u[na]. Notice that 



t t r r— 1 

^[(^ + e J^) a ] — = v^2([(n + ^2ejhj)a} - [{n + ^ejh 3 )a\ 

j=l r =l jf=l j=l 

. ,nt) ^ u, men ior eacn i ^ r ^ i, we musx nave ^^"1" Z? T 



If #(n; hi, ... , h t ) ^ 0, then for each 1 < r < t, we must have {(n + J^ =1 £jhj)a}, 
{{n + Ylj=i e jhj)a} G /. By the property of /, the length of I is less than | and 
/ does not contain any point |(s = 0, 1). Thus it is easy to verify that 

r r—l 

[(n + ^ eAH - [(n + ^ ^AH = e r ([h r a] + D(h r )), 

3=1 3=1 

where D{h r ) is the unique integer such that {h r a} G ( jP ^ r ' > , D ^^ +1 ). This means 
([7]) is independent of n if hi, . . . ,h t ) ^ 0. This finishes the proof. □ 
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4.2. Proof of Theorem 11.11 We prove Theorem 11.11 in this subsection. 

Definition 4.3. Let (X, X , fi,T) be a measure preserving system. A factor Z is 
called a characteristic factor, or characteristic, for the family of integer sequences 
{ai(n), . . . , dfc(n)} ; if for any fi, ■ ■ ■ , fk £ L°°(fj,),at least one of which is orthogonal 
to Z, the average 

i N 

J^T^ h(x) ■ . . . ■ T a *W f k (x) 

n=l 

converges to in L 2 (fi) as N — > oo. 

The following theorem proved plays an important role in the study of the char- 
acteristic factor: 

Theorem 4.4. (Host and Kra, [IB]j For any k £ N and any measure preserving 
system, there exists a factor Z k which is characteristic for {n,2n, . . . , kn}, and 
a.e. ergodic component of Z k is an inverse limit of k- step nilsystems. 

We refer the reader to [32] for details about characteristic factors. The following 
proposition can be deduced directly from Proposition 4.1 of [7]: 

Proposition 4.5. Let (X, X , fi,T) be a measure preserving system. Let a £ 
M\Q, a, b £ Z. Then there exists I £ N such that the factor Z\ in Theorem \4-4\ is 
characteristic for both {[(an + b)a], . . . , k[(an + b)a]} and [(an + b)a], . . . , [k(an + 

&)«]}■ 

Remark 4.6. Proposition 4-1 of [7j is stated for the case {[(an + b)a], . . . , k[(an + 
b)a]} but the proof of the case [(an + b)a\, . . . , [k(an + b)a]} is included in the proof. 

Proposition 4.7. Let a £ M\Q, a,b £ Z. For any nilmanifold X = G/Y, with G 
connected and simply connected, any g £ G,x £ X and any Fi(x), . . . ,F k (x) £ 
C(X), the limit 

lim — V F 1 (g^ an+b)a] x ) • . . . • F k (g k[{an+b)a] x ) 

71=1 

exists. The conclusion still holds if i[na] is replaced with [ina] for i = 1, . . . , k. 

Proof. The case a = is trivial, so in the rest of the proof we assume o ^ 0. 
Without loss of generality, we assume xq = Y. The case {[not], . . . , fc[na]} is 
Lemma 4.7 of [6]. So we only need to prove for the case {[tick], . . . , [fcna]}. Let 
X = G/Y, where G = R k x G k , f = Z fc x Y k and g = (1, . . . , 1, g,g 2 ,..., g k ). Then 
G is also connected and simply connected. It is well known that if G is connected 
and simply connected, the exponential map exp(x) : g — > G is a bijection, where q 
is the Lie algebra of G. So for any t £ K, we may define g t = exp(tA), where A £ g 
is the unique element such that g = exp(A). We define a function F: X — > C by 

F(t,Z, t k Z, 9l Y, . . . , ^r) = F 1 (g^g 1 Y) ■ F 2 (g~^g 2 Y) • . . . ■ F k (g-^g k Y). 
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(We caution the reader that F may not be continuous.) Notice that 

= Fi(g^ an+bS)a ^T) ■ Ffc(y' fc(cm+f, ) a 'r), 
so it suffices to show the limit 

1 - 

lim — X" p(g( an+b)a f) 

n=l 

exists. For any 5 > (and sufficiently small) there exists a function i 7 ^ 6 C(X) 
that agrees with F on X$ = Is x X, where 7,5 = {(tiZ, . . . , t^Z) : maxo<j<i_i — 
f|| > <M = 1, •••>&}, an d is uniformly bounded by 211-FHoo. Denote J = {tZ : 
max <j<j<A;||t - f|| > 5}. If t G J , then (t,...,t) G Is- Since ({(an + b)a}) neN is 
uniformly distributed on T, the density of the set of n in {1, . . . , N} (as N — > oo) 
such that F(g( an+ Q a f) ^ F 5 (y( an+b ) Q f) is at most 1 - |/ | = C k 5 for some C k > 
depending on /c. So 

' < 4L7JIFIL. 



(8) lim sup — V|F(y (an+fe)Q f ) - F 5 (y (an+6)Q f) 

n=l 

By Theorem B of [22], the limit 

1 N 

lim — Fs{g (an+h)a V) 

n=l 

exists. It is then easy to deduce from (jSj) that {-^ X^=i ^(y^^^f )}iveN is a 
Cauchy sequence. So the limit exists. □ 

With the help of all the above material, we are now able to prove the following 
multiple convergence result: 

Proposition 4.8. Let a G K\Q, k G N. For any measure preserving system 
(X, X, fi, T) and any fi, ■ ■ ■ , fk £ L°°(fi), the average 

1 N 

— J2 Ti{an+b)a] fi(x) ■■■■■ T k ^ an+b ^f k (x) 

n=l 

converges in L 2 ([i) as N — )■ oo /or a// integers a,b. The conclusion also holds if 
i[(an + b)a] is replaced with [i(an + b)a] for % = 1, . . . , k. 

Remark 4.9. It is worth noting that the case {[na], . . . , fc[na]} is proved in The- 
orem 2.1 of [Z] . 

Proof. The method follows form [6] and [7]. We can assume /j, 1 < i < k are 
continuous by an approximation argument. By Proposition 14. 5[ we can replace the 
original space with Z\ for some I G N. Using an ergodic decomposition argument, it 
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suffices to prove this proposition when the system is an inverse limit of nilsystems. 
By an approximation argument, we can simply consider the case when the system 
X = G/Y is a nilsystem. 

We further simplify our discussion as follows (see |22J): since the average we deal 
with involves only finitely many actions on X, we may assume the discrete group 
G/Gq is finitely generated, where Go is the connected component of G containing 
the unit element idc- In this case one can show that X is isomorphic to a sub- 
nilmanifold of a nilmanifold X = G/Y, where G is connected and simply connected 
and for any F(x) G C(X), b G G, x G X, there exists F(x) G C(X), b G G,x G X 
such that F(b n Xo) = F(b n xo) for any n G N. So it suffices to consider the case when 
G is connected and simply connected, and the conclusion follows from Proposition 

E3 □ 

Proof of Theorem M . 1\ By Lemma I3TT] it suffices to prove the convergence in L 2 (fi) 
for the corresponding averages 

1 - 

A(N) = ^J2 ^'(n)T {na] fi • • • • ■ T k ^f k . 



n=l 



Equivalently, it suffices to show that the sequence of functions {A(iV)}7v e N is 
Cauchy in L 2 (/x). Let e > 0. Fix w,r G N and let 

1 - 

(iV) ^ ^ rp[(Wn+r)a] _ _ rpk[(Wn+r)na] 



n=l 



Let 



1 - 

Ai (N) = -^A'He ! (W)TH/ 1 ...T fc H /fc 



n=l 



1 N 

B lw {N) = - U{™})T [{Wn+r)a] fi ■ ■ ■ T h ^ Wn+r ^f k 



n=l 



where ^i(x) = 1^ »-i i-i By Propositions 13.21 and 14. 1[ we have that for any 



« (fc+i)! ' (fc+i) 

Wq G N (and corresponding Wq G N) large enough, if N is large enough, then 



A(W N) 



HW 



< 



l<r<W ,(r,W )=l 



L*{») ~ (k + l)V 



Thus 



A{W " N) ~ MWo) ^ B "" (N) 

l<r<W ,(r,W )=l 



L2(„) 



< e, 



when N is large. By Proposition [4TH1 for r = 1, . . . , W , the sequence {B W0)T .(N)}^ eK 
converges in L 2 (fi). Therefore, if N' and N are sufficiently large, then for r = 
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1, . . . , Wq we have 



B WOjr (N) - B WOtr (N') 



La GO 



< e. 



Thus if N is large enough, 
A(W N) - A(W N' 

Notice that for r = 1, . . . , W 



< 



B WOtr (N) - B mhT (N') 



L 2 (P) 



+ 2e < 3e. 



(9) 



lim 



A(W N + r) - A(W N) 



L2( M ) 



0. 



Thus if N and N' are sufficiently large, 



(10) 



A(N) - A(N') 



L 2 M 



< 4e. 



Therefore the sequence {^(AQj-jveN is Cauchy and this finishes the proof. 



□ 



4.3. Proof of Theorem 11.21 We prove Theorem 11.21 in this subsection. The 
following proposition is from Theorem 2.1 of [3]: 

Proposition 4.10 (Bergelson, Host, McCutcheon and Parreau, j3]). For any 5 > 

0,fceN, there exists c(5), N(8) > 0, such that for any probability system (X, X, (j,), 
any commuting measure preserving transformations Ti, . . . , Tk, and any fi(A) > 5, 
there exists < n < N(S) such that 

fi{An T{ n A n • • • n T- n A) > c{5). 
We need to use the following uniform multiple recurrence result along integers: 

Proposition 4.11. For any 5 > 0, k G N, there exists c(S) > such that for any 
a G M\Q ; any measure preserving system [X,X,fi,T) and any fi{A) > 5, we have 

liminf -V/ifin T- [na] A fl • • • fl T~ k[na] A) > c(S). 



n=l 



The conclusion also holds if i[na] is replaced with [ina] for i — 1, . . . , k. 

Remark 4.12. Notice the c(5) in this theorem does not depend on a. 

Proof. The proof follows the method used in [5]. Denote q(ri) = [na],p m (n) = 
[ma]n. For any r G N, denote S r = {in G N: {ma} < 1/kr}. It is easy to 
see if m G S r , then q(inm) = p m (in) = iq(nm),l < n < r, 1 < i < k. Let 
c(6),N(6) > be the same as Proposition 14.101 and let Tj = T^ ma \ Then there 
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exists 1 < n < [N{8)\, such that /i(Anr p ™( n Un- • • nT- pm( - kn ^A) > c(5). Clearly, 

N 

'"v'-'x" .Y 



liminf — V u(A n T- q{n) n • • - n T- 9(fcn) A) 

n=l 

[-] 

> liminf -V/i(An r-ff(*»«) n • • • n r- 9(femn) A), 



m=l 

for any nGN. Notice that the set 5 , [jy(5)] has density 1/[JV(5)], thus 

AT 

"v-'x* A" 



1 N 

lim inf — V u(A n T- ?(n) n • • • n r- 9(fcn) A) 



?1=1 



> — - — liminf — V V uM n T~ 9(mn) Pi - - - Pi T-«( fcm ™)yl) 
~ \N(6)] w^oo AT ^ ^ v 7 

L m=l n=l 



> — - — liminf — V V ufi n r~ Pm(n) A n • • • n T~ Pmikn) A) 

- \N(S)] N-Hx> N ^— ' ^— ' 7 

"■eVwil 1 Im" n=1 



J . . . J 



> — M — > o. 

- fc[iy(«5)] 3 

This finishes the proof of the case {[na], . . . , [fcna]}. The proof of the case {[na 
k[na}} follows by replacing q(in) with iq{n). □ 

Proof of Theorem \1.2\ We only need to prove the situation for P— 1 since the other 
case is similar. By Proposition 14. 1H there exists a positive constant c depending 
only on /Jj(A) such that for any W > 

(fc+l)! N 

V lim sup — V £i({nWa})fi(A n T~ [nWa] A n • • • D T- [/cnWa U) 

i=l ^ n=l 

1 - 

> lim sup — ^ //(A n r- [nWa] A n • • • n r- [fcnH/Q U) > c, 



V-S'oo 



n=l 



where = \, <-i . So one term on the left hand side is larger than 

Tj^jM- Suppose this is the term corresponding to i G {1, . . . , (k + 1)!}. 

If we apply Proposition 14.11 with Wa, fo — • • ■ — fk — 1a, b(n) = A' wl (n) — 
l,£(x) = £i(x) and use the fact that when w (and corresponding W) large enough, 
|| {A.' wl (n) — l)£i({nWa}) ■ 1[i,a^] 1 1 t/ fc (^ feA r) can ^ e sufficiently small as iV — > oo (by 
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Proposition I3.2p . we can deduce that if w is large enough, then 
1 - 

limsup -VAL x (n)&{{nWa})ii{A n T~ [nWa] A n • • • n T- [fcnH/Q U) > 0. 

n=l 

This finishes the proof of the case {[na], . . . , [fcna]}. The proof of the case {[na], 
. . . , fc[n«]} can be obtained by replacing T^ nWa ^ with T^ nWo ^ in the discussion. □ 

5. Other Applications 

5.1. Cube Averages along Shifted Primes. Every element e G {0, l} d is iden- 
tified as a sequence e = (ex, . . . , e d ), e { G {0, 1}. Write {0, 1}* = {0, l} d \{(0, . . . , 0)}. 
The convergence result along cubes along integers was obtained in [19]. We indi- 
cate how the same method used for proving Theorem 11.21 can be applied for the 
averages along cubes along primes: 

Theorem 5.1. For any k G N, any measure preserving system (X, X,[i,T) and 
any f e G L°°(//),e G {0, 1}* functions bounded by 1, the average 

_ ST^ TT T^PieiH t-Pfctfc £ 

n(N) k 11 1 J* 

y ' Pi,-,Pfee[i,JV]nPee{o,i}S 

converges in L 2 (fi) as N — )■ oo. 

Theorem 5.2. Let V = P + 1 or P — 1. For any sequence of integers {WjvliVeN 
TOi/i lim7v-s>oo Wjv = oo, Wjv = o(k>g X) ; any fceN, any measure preserving system 
(X, X, fi, T) and any A G X, (i(A) > 0, we have 

liminf — —r V u( Pi T-^ (ni£1+ - +nfcefc) ^ > 0. 
iv^oo vr(A) fc V ' ' / 

v ' ni,...,n t ePn[l,iV] ee{0,l} fc 

Remark 5.3. It is not known whether one can drop the factor Wn in this theorem 
(i.e. Wn — 1). Unlike Theorem \1.2\ where we only require the set A to return 
for a single time, Theorem \5.S\ concerns how often the set A recurs. The reason 
we need Wn is that Proposition Iff.ffl is the estimate for the modified von Mangoldt 
function, and we do not have an estimate for the ordinary von Mangoldt function. 

We outline the key ingredients for the proofs. 

Proposition 5.4. Let fceN, (X, X,/i, T) be a measure preserving system, f e G 
L°°(/i),e G {0,1}^ be functions bounded by 1. Let &i(n), . . . ,bk(n): N — > C be k 
sequences of complex numbers satisfying bi(n)/n c — >■ for all c> and 
J! Yju=i \ b *( n )\ < U 1 < * < fy. Then for any 1 < j < k, 

k 

W k E II W II u « d h ■ i [m + o N (i). 



N k 

l<n 1 ,...,n k <N i=l e£{0,l}J 



U 2 (Z 2 n) 
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Further more, the implicit constant is independent of {bi(n)} neN , . . . , {bk(n)} n£ ?q 
and the o N (l) term depends only on {&i(^)} n eN ; • • • ? {&fc(^)}neN- 

Remark 5.5. Proposition 7.1 of [T3] shows that this proposition is true for X = 

Proof. We may assume j — k since it is the same for other cases. Fix f e L 2 (fi) 
with norm 1. Denote 

1 N 

M (^,...,n fc _ 1 ) = -^6 1 (n 1 )-...-6 fc K) J] T ^+-+^f e . 

n k =l ee{0,l} k 

We only need to show 

k 



Lw e n ".(«,) n t *-' 

1 i<m,...,n fc <JV i=i ee{o,u fe 



&fc • 1[1,JV] 



.,n fc SJV i=l ££{0,1}* 

Let g' e = f e0 ,g e = f el . Then 

k 



U 2 {1 2 n) 



o N (l). 



Lw: e n •'■>-• 

1 l<ni,...,n k <N i=l ee{0,l} k 

1 fc-1 1 AT 



l<ni,...,nfe_i<A r i=l 

By Lemma [2. 1[ 

A? 



1=1 eejO,!}*- 1 



L 2 (p) 



1=1 ee{o,i} fc - 1 

N „ AT-/l 



« JJ E^ E / + *) II rSU^Oyrt 

h=i n =l ^ eelojp- 1 

^El^rE^( n ) 6fc ^ + /i ) 



h = l 71=1 



ee{o,i} 
+ ojv(1) < 

The proposition follows from the fact that 

1 fc-i 

e in* 



bk ■ 1[1,AT] 



f/ 2 (Z 2iv ) 



+ Ojv(1) 



+ o N (l) 



N k-1 



l<ni,...,rik_ 1 <N i=l 



converges. 



□ 



As the detail of the proofs of Theorem 15 . 1 1 and 15.21 are similar to (in fact, easier 
than) that of Theorems 11.11 and II. 2[ we just indicate the main steps. By using 
Lemma 13.11 and corresponding convergence and recurrence results along integers 
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(Theorems 1.2 and 1.3 of [IB]), it suffices to show that the difference between the 
averages along primes and that along integers are small. And this is achieved by 
the upper bound obtained in Proposition 15.41 and the estimate of Gowers norm in 
Proposition 13.21 We left the details to the reader. 

5.2. Weighted Averages along Primes. We indicate how the same method 
used for proving Theorem 11.21 can be applied to generalize Theorem 2.24 of [TH] to 
primes: 

Theorem 5.6. Let a{n) be a bounded k-step nilsequence. Then for any measure 
preserving system (X, X,n,T) and any fi, ■ ■ ■ , fk € L°°(fi), the average 

E a{p)T^h{x).....T^f k {x) 

^ ' P e[i,JV]nP 

converges in L 2 (fi) as N — > oo. 
We outline the key ingredients for the proof. 

Proposition 5.7. Let (X, X, /i, T) be a measure preserving system and /i, •••,/& £ 

Let b(n) : N — > C be a sequence of complex numbers satisfying b{n)/n c — > 
0,Vc> 0. Then 



1 \ - 



N 

kn . 



n=l 



U k (1 kN ) 



Further more, the implicit constant is independent of {b(n)} ne ^ and the 0^(1) 
term depends only on the integer k and {fr(n)} n£ N. 

This proposition is proved in The proof is similar and actually easier than 
the proof of Propositions 14.11 and I5.4L 



As the detail of the proofs of Theorem 15.61 is similar to (in fact, easier than) 
that of Theorem II. 2[ we just indicate the main steps. By using Lemma 13.11 and 
the corresponding convergence result along integers (Theorem 2.24 of [E]), it 
suffices to show that the difference between the averages along primes and that 
along integers are small. And this is achieved by the upper bound obtained in 
Proposition 15.71 and the estimate of Gowers norm in Proposition 13.21 We left the 
details to the reader. 
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